o 
O 



a 



SEQUENCE LISTING 
<110> Max-Planck-Gesellschaft e.V. 

<120> Protein expression and structure solution using 
specific fusion vectors 

<130> ST010209-EPA 

<140> 01100762.2 
<141> 2001-01-12 

<160> 3 

<170> Patentln Ver. 2.1 



F <210> 1 

Ly <211> 765 

□ <212> PRT 

UJ <213> Artificial Sequence 



» <220> 

J <223> Description of Artificial Sequence: Partial myosin 
M- sequence of Dictyostelium; Component (1) of the 

p recombinant protein M761-2R R238E 



<400> 1 



Met Asp Gly Thr Glu Asp Pro He His Asp Arg Thr Ser Asp Tyr His 
15 10 15 

Lys Tyr Leu Lys Val Lys Gin Gly Asp Ser Asp Leu Phe Lys Leu Thr 
20 25 30 

Val ser Asp Lys Arg Tyr He Trp Tyr Asn Pro Asp Pro Lys Glu Arg 
35 ™ 45 

Asp ser Tyr Glu Cys Gly Glu He Val Ser Glu Thr Ser Asp Ser Phe 
50 " 60 

Thr Phe Lys Thr Val Asp Gly Gin Asp Arg Gin Val Lys Lys Asp 



70 



75 



Ala Asn Gin Arg Asn Pro He Lys Phe Asp Gly Val Glu 



Asp 
80 



Asp Met Ser 



85 90 95 



Glu Leu Ser Tyr Leu Asn Glu Pro Ala Val Phe His Asn Leu Arg Val 
100 



1 



Arg Tyr Asn Gin Asp Leu He Tyr Thr Tyr Ser Gly Leu Phe Leu Val 



115 



.120 



125 



Ala Val Asn Pro Phe Lys Arg He Pro He Tyr Thr Gin Glu Met Val 



130 



135 



140 



Asp lie Phe Lys Gly Arg Arg Arg Asn Glu Val Ala Pro His He Phe 



145 



150 



155 



160 



Ala lie Ser Asp Val Ala Tyr Arg Ser Met Leu Asp Asp Arg Gin Asn 



165 



170 



175 



Gin Ser Leu Leu He Thr Gly Glu Ser Gly Ala Gly Lys Thr Glu Asn 
— 180 185 190 



Thr Lys Lys Val He Gin Tyr Leu Ala Ser Val Ala Gly Arg Asn Gin 
195 200 205 

Ala Asn Gly Ser Gly Val Leu Glu Gin Gin He Leu Gin Ala Asn Pro 



210 215 



220 



O Leu Glu Ala Phe Gly Asn Ala Lys Thr Thr Arg Asn Asn Asn Ser 

C 230 23 * 240 



Ser Arg Phe Gly Lys Phe He Glu He Gin Phe Asn Ser Ala Gly Phe 
245 250 255 

He ser Gly Ala Ser He Gin Ser Tyr Leu Leu Glu Lys Ser Arg Val 
260 265 2?0 

Val Phe Gin Ser Glu Thr Glu Arg Asn Tyr His He Phe Tyr Gin Leu 
275 280 285 

Leu Ala Gly Ala Thr Ala Glu Glu Lys Lys Ala Leu His Leu Ala Gly 
290 295 300 

Pro Glu Ser Phe Asn Tyr Leu Asn Gin Ser Gly Cys Val Asp He Lys 
5 310 315 320 

Gly Val ser Asp Ser Glu Glu Phe Lys He Thr Arg Gin Ala Met Asp 
325 33 0 335 

He Val Gly Phe Ser Gin Glu Glu Gin Met Ser He Phe Lys He He 
340 345 350 

Ala Gly He Leu His Leu Gly Asn He Lys Phe Glu Lys Gly Ala Gly 



355 3 6 o 365 
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y 



Glu Gly Ala Val Leu Lys Asp Lys Thr Ala Leu Asn Ala Ala Ser Thr 
370 375 380 

Val Phe Gly Val Asn Pro Ser Val Leu Glu Lys Ala Leu Met Glu Pro 
385 390 395 400 

Arg lie Leu Ala Gly Arg Asp Leu Val Ala Gin His Leu Asn Val Glu 
405 410 415 

Lys Ser Ser Ser Ser Arg Asp Ala Leu Val Lys Ala Leu Tyr Gly Arg 

420 425 430 

Leu Phe Leu Trp Leu Val Lys Lys lie Asn Asn Val Leu Cys Gin Glu 
435 440 445 

Arg Lys Ala Tyr Phe lie Gly Val Leu Asp lie Ser Gly Phe Glu lie 
450 455 460 



O Phe Lys Val Asn Ser Phe Glu Gin Leu Cys lie Asn Tyr Thr Asn Glu 

W 465 470 475 480 



Lys Leu Gin Gin Phe Phe Asn His His Met Phe Lys Leu Glu Gin Glu 
485 490 495 



Q Glu Tyr Leu Lys Glu Lys lie Asn Trp Thr Phe lie Asp Phe Gly Leu 

RJ 500 505 510 

Asp Ser Gin Ala Thr lie Asp Leu lie Asp Gly Arg Gin Pro Pro Gly 
515 520 525 

lie Leu Ala Leu Leu Asp Glu Gin Ser Val Phe Pro Asn Ala Thr Asp 
530 535 540 

Asn Thr Leu lie Thr Lys Leu His Ser His Phe Ser Lys Lys Asn Ala 
545 550 555 560 

Lys Tyr Glu Glu Pro Arg Phe Ser Lys Thr Glu Phe Gly Val Thr His 
565 570 575 

Tyr Ala Gly Gin Val Met Tyr Glu lie Gin Asp Trp Leu Glu Lys Asn 
580 585 590 

Lys Asp Pro Leu Gin Gin Asp Leu Glu Leu Cys Phe Lys Asp Ser Ser 
595 600 605 

Asp Asn Val Val Thr Lys Leu Phe Asn Asp Pro Asn lie Ala Ser Arg 
610 615 620 
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Ala Lys Lys Gly ^ ^ phe ^ Thr ^ ^ ^ ^ ^ ^ ^ 



630 



635 



640 



Gin Leu Ala Ser Leu Met Ala Thr Leu Glu Thr Thr Asn Pro His Phe 



645 



650 



655 



Val Arg Cys He He Pro Asn Asn Lys Gin Leu Pro Ala Lys Leu Glu 
660 665 670 

Asp Lys Val val Leu Asp Gin Leu Arg Cys Asn Gly Val Leu Glu Gly 
675 680 685 

He Arg He Thr Arg Lys Gly Phe Pro Asn Arg He He Tyr Ala Asp 

695 700 

Phe Val Lys Arg Tyr Tyr Leu Leu Ala Pro Asn Val Pro Arg Asp Ala 

715 720 
Glu Asp Ser Gin Lys Ala Thr Asp Ala Val Leu Lys His Leu Asn He 
725 7 30 735 

P Asp Pro Glu Gin Tyr Arg Phe Gly He Thr Lys He Phe Phe Arg Ala 

L_L 740 „ r- * 



H 740 745 

Q Gly Gin Leu Ala Arg He Glu Glu Ala Arg Glu Gin Arg 

m 755 760 765 



750 



<210> 2 
<211> 1016 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Whole sequence 
of recombinant protein M761-2R R238 E 

<220> 

<223> The protein comprises as component (1) the aa 
sequence of myosin H motor domain of 
Dictyostelium, a three aa linker region and the 
a-actinin aa sequence 

<400> 2 



Met Asp Gly Thr Glu Asp Pro He His Asp Arg Thr Ser Asp Tyr His 
5 10 15 



4 



20 25 



IS S 



« 



30 



Val Ser Asp Lys Arg Tyr He Trp Tyr Asn Pro Asp Pro Lys Glu Arg 
35 40 45 

Asp Ser Tyr Glu Cys Gly Glu He Val Ser Glu Thr Ser Asp Ser Phe 
50 55 60 

Thr Phe Lys Thr Val Asp Gly Gin Asp Arg Gin Val Lys Lys Asp Asp 
65 70 75 30 

Ala Asn Gin Arg Asn Pro He Lys Phe Asp Gly Val Glu Asp Met Ser 

- Glu Leu ser Tyr Leu Asn Glu Pro Ala Val Phe His Asn Leu Arg Val 

110 



100 105 



Arg Tyr Asn Gin Asp Leu He Tyr Thr Tyr Ser Gly Leu Phe Leu Val 
115 120 125 



Ala Val Asn Pro Phe Lys Arg He Pro He Tyr Thr Gin Glu Met Val 
f=A 130 135 140 



Asp He Phe Lys Gly Arg Arg Arg Asn Glu Val Ala Pro His He Phe 
145 150 155 160 

Ala lie Ser Asp Val Ala Tyr Arg Ser Met Leu Asp Asp Arg Gin Asn 
165 170 175 

Gin Ser Leu Leu He Thr Gly Glu Ser Gly Ala Gly Lys Thr Glu Asn 
180 185 19Q 

Thr Lys Lys Val He Gin Tyr Leu Ala Ser Val Ala Gly Arg Asn Gin 
195 200 2 o5 

Ala Asn Gly Ser Gly Val Leu Glu Gin Gin He Leu Gin Ala 



210 215 



He Leu Glu Ala Phe Gly Asn Ala 
225 230 235 



Asn Pro 

220 

Lys Thr Thr Arg Asn Asn Asn Ser 

240 



Ser Arg Phe Gly Lys Phe He Glu He Gin Phe Asn Ser Ala Gly Phe 



245 



250 



255 



He Ser Gly Ala Ser He Gin Ser Tyr Leu Leu Glu Lys Ser Arg Val 
260 265 270 

Val Phe Gin Ser Glu Thr Glu Arg Asn Tyr His He Phe Tyr Gin Leu 
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□ 



275 



280 



285 



Leu Ala Gly Al. Thr Ala Glu Glu Lys Lys Ala Leu His Leu Ala Gly 
290 295 300 

Pro Glu Ser Phe Asn Tyr Leu Asn Gin Ser Gly Cys Val Asp He Lys 

310 315 320 

Gly Val ser Asp Ser Glu Glu Phe Lys He Thr Arg Gin Ala Met Asp 



325 



330 



335 



He val Gly Phe- Ser Gin Glu Glu Gin Met Ser He Phe Lys He He 

345 



Ala Gly He Leu His Leu Gly Asn He Lys Phe Glu Lys Gly Ala Gly 
355 360 

Glu Gly Ala Val Leu Lys Asp Lys Thr Ala Leu Asn Ala Ala Ser Thr 
370 37 5 380 

Val Phe Gly Val Asn Pro Ser Val Leu Glu Lys Ala Leu Met Glu Pro 

390 3 o S 

jyo 400 

Arg Ha Leu Ala Gly Arg Asp Leu Val Ala Gin His Leu Asn Val Glu 

405 4io 415 

Lys ser Ser Ser Ser Arg Asp Ala Leu Val Lys Ala Leu Tyr Gly Arg 
420 425 430 

Leu Phe Leu Trp Leu Val Lys Lys He Asn Asn Val Leu Cys Gin Glu 
435 440 445 

Arg Lys Ala Tyr Phe He Gly Val Leu Asp He Ser Gly Phe Glu He 



455 



460 



Phe Lys Val Asn Ser Phe Glu Gin Leu Cys He Asn Tyr Thr Asn Glu 



470 



475 



Lys Leu Gin Gin Phe Phe Asn His His Met Phe Lys 



485 



490 



480 



Leu Glu Gin Glu 
495 



Glu Tyr Leu Lys Glu Lys He Asn Trp Thr Phe He Asp Phe Gly Leu 
500 5 °5 510 

Asp ser Gin Ala Thr He Asp Leu He Asp Gly Arg Gin Pro Pro Gly 
515 5 20 525 

He Leu Ala Leu Leu Asp Glu Gin Ser Val Phe Pro Asn Ala Thr Asp 



530 



535 



540 



Asn Thr Leu lie Thr Lys Leu His Ser Hi « dk. o 

545 / 13 Ser Hls Phe s er Lys Lys Asn Ala 

JOU err 

560 



555 



Lys Ty r «„ „. , Atg phe s „ Lys ^ ^ ^ ^ ^ ^ ^ 

570 575 
Tyr M a Gly ol„ v,L „ et Tyr slu Ile sln asp Ttp ^ ^ 

D85 590 

L y s Asp Pro Leu Gln G l„ Asp ^ Glu Leu cys phe ^ ^ ^ ^ 

600 605 
Asp As„ v.! v.l Thr L ys ^ phe isp pro flsn sm 

615 620 

Ala Lys Lys Gly Ala Asn Phe lie Thr Val Al* m = ^ „ 
625 1 Ala Ala Gln T yr Lys Glu 

630 63 * 640 

val Arg cys Xle Xle Pro Asn Asn Lys Gin Leu Pro Ala Lys Leu Glu 

665 670 
A*P Lys Val v.! Leu Asp G l„ Leu Ar 9 cys _ Gly „ al Leu ^ ^ 

680 685 

ne «, n. Tht Arg Lys Gly fc Pro to Mg n> ^ 

695 700 
». « Lye Ar, Tyr T ^ le „ pro ^ ^ pto ^ ^ 

/13 720 
«. Asp ser G l„ Lye Ale Thr Asp „, w ^ u Lys m , ^ ^ ^ 

730 735 

Asp Pro Glu Gin Tvr Ara Ph^ ri „ n 

yr Arg Phe Gly H e Thr Lys Ue phe phe ^ 

745 750 

Oly G ln Leu Ala Arg xle Glu Glu Ala Arg Glu Gin Arg Leu Gly ser 

760 765 

Olu Th r Lye Ser Asp Tyt Leu Lys Ar g Ala Asu G1 » Leu Val Gln 

775 780 

Trp He Asn Asp Lys Gln Ala s*r Tei , r*i ~ 

y xn Ala Ser Leu Glu Ser Arg Asp Phe Gly Asp 



785 7 90 7qs 

'« 800 



Ser lie Glu Ser Val Gin Ser Phe Met Asn Ala His Lys Glu Tyr Lys 
805 810 815 

Lys Thr Glu Lys Pro Pro Lys Gly Gin Glu Val Ser Glu Leu Glu Ala 
820 825 330 

He Tyr Asn Ser Leu Gin Thr Lys Leu Arg Leu He Lys Arg Glu Pro 
835 840 845 

Phe Val Ala Pro Ala Gly Leu Thr Pro Asn Glu He Asp Ser Thr Trp 
M 850 855 860 

g Ser Ala Leu Glu Lys Ala Glu Gin Glu His Ala Glu Ala Leu Arg He 

| 870 875 880 

W Glu Leu Lys Arg Gin Lys Lys He Ala Val Leu Leu Gin Lys Tyr Asn 

2 885 890 895 

□ Arg 116 L6U Lys L V S Leu Asn Trp Ala Thr Thr Lys Ser Val Tyr 

M 900 905 910 

U Gly ser Asn Glu Thr Gly Asp Ser He Thr Ala Val Gin Ala Lys 

jjj 915 920 925 

Leu Lys Asn Leu Glu Ala Phe Asp Gly Glu Cys Gin Ser Leu Glu Gly 



935 



940 



Gin ser Asn Ser Asp Leu Leu Ser He Leu Ala Gin Leu Thr Glu Leu 



950 



955 



960 



Asn Tyr Asn Gly Val Pro Glu Leu Thr Glu Arg Lys Asp Thr Phe Phe 
965 970 975 

Ala Gin Gin Trp Thr Gly Val Lys Ser Ser Ala Glu Thr Tyr Lys Asn 



980 



985 



990 



Thr Leu Leu Ala Glu Leu Glu Arg Leu Gin Lys He Glu Asp Ala Leu 

995 iooo 1005 

His His His His His His His His 
1010 1015 



<210> 3 
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<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for recombinant protein M761-2R R238E 



<400> 3 



atggatggta ccgaggatcc aattcatgat agaacttcag attatcacaa atacttaaaa 60 
gttaaacaag gtgattctga tttatttaaa cttactgttt cagataagag atacatttgg 120 
tct! a ,r^ ! tCCaaaaga aa ^"ca tatgaatgtg gtgaaattgt ttcagaaacc 180 
tctgattctt tcacattcaa aaccgttgat ggtcaagaca gacaagtcaa aaaggatgat 240 
ctcaat gtaatCCaat — ttcg.t ggtgtcgaag atatgtctga attatcatac 300 

acctattc" Cag ! agtttt ~*°**^ cgtgttcgtt acaatcaaga tttaatttac 360 
Q acctattcag gtctcttttt ggttgccgtc aatccattca agagaattcc aatctacact 420 

9 ™ c t g9 \ t9 :! atCtt ""Wtcgt agaagaaatg aagttgcccc acatattttc 4 

* ? atgttgCCta tcgttcaatg ttagatgatc gtcaaaatca atcactctta 540 

Latc C t!t a ! tCtggtgC ^aagact gaaaacacca aaaaggtcat tcaatatctt 600 
9 Ct *f tcaagccaat ggtagtggtg tattggaaca acaaattctc 660 

tctcattt C ! atCCttga -^tttggt aatgccaaaa ccacccgtaa caacaattca 720 
tctcgtttcg gtaaattcat tgaaattcaa ttcaacagtg ctggtttcat tagtggtgct 780 

- iittt agagaaatca cgtgtcgttt -™ z 

^ aattatcaca ttttctatca actcttagct ggtgccaccg ccgaagaaaa gaaagctctt 900 

t aotatcT, 9 g ! CCagaatC att — tac ttaaatcaaa gtggttgtgt tgatatcaaa 960 

ggtgtctctg atagtgaaga attcaaaatc actcgtcaag ctatggacat tgttggtttc 1020 

:::::::: aacaaat r aatctttaag atcattgctg gtat — -a - 

atcaaattcg aaaaaggtgc tggtgaaggt gctgtcctca aagacaaaac cgccctcaac 1140 
!atltt t r a CCgtCtttgg t9tCaatC " tcagtccttg aaaaggctct catggaacca 1200 
cgtattttag ccggtcgtga tttagttgct caacatctca acgttgaaaa ate catca 1260 

; :™ :::::r: aa a9ctctctat ggtcgtcttt tcctctggtt ggt ~ ™* 

ggttttaaaa tttt I ********** gcttacttta ttggtgtttt ggatatttca 1380 

ggttttgaaa ttttcaaagt caattcattc gaacaattat gtatcaatta taccaatgaa 1440 

gaLaaatc 3 : lit lit aacaagaaga atatcttaaa 50 

TttTeZaZ T" « tc *»«*t ggtcttgatt cacaagccac tatcgattta 1560 

attgatggtc gtcaaccacc aggtatttta gctcttttgg atgaacaatc tgttttccca 1620 
aatg Ccaccg ataatacttt aatcaccaaa ctccacagtc actttagcaa J cc ™° Q 

C t7 aaCCaCgttt CtC — gaatttggtg ttacccatta tgctggtcaa 
gaactt 9 t atg ^"^f "ggttagaa aagaacaaag atccattaca acaagatctc 10 
gaactttget tcaaagattc atcagacaac gttgtcacca aacttttcaa tgatccaaac I860 

:::::;::: c 9tg t caa t a9aa a9gtgcaaac tttatcactg 

caattagect cactcatggc nacccttgaa accaccaacc cacatttcgt tcgttgtatc 1980 

: ::::::: :r: ac : att accagccaaa ctcgaagata aagttgt - ™£ 

cgtt gcaatg gtgtcctcga aggtattcgt attactcgta aaggtttccc aaatcgtatfc 210Q 
atetatgecg atttegtcaa aegttactat ttattagctc caaacgttcc aagaga^ct 2 60 

tat 9 :;::::: : t aa : a9ccac cgatgctgtt ctcaaacatc ttaacattg * ^~ 

tategttteg gtatcaccaa gattttcttc cgtgccggtc aattagctcg tattgaaaaa 2280 
getegtgaac aacgtctagg atccgaacaa accaaatctg attatLaa aagagcca.t 23 0 
gaac cgt aatggattaa cgataaacaa gcafccacttg aafcca a tt l£££ 

tccatcgaat ctgttcaaag tttcatgaac gctcataaag aatataaaaa aacc a a 2460 
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•gt.te.et, ctgttcagc taaatt.a.a !„»,T g«tea.tg. .accggtgac 2760 
tcattggaag gtLtc... c"t g a t c " HT g Zll TV*" "'"^ 

= ~r <~ ~ : = - 

tt.aaa.ga £™ - = = - -gaa.g, 3000 
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